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SELF-*REPLICATXN6 EPISOHAIi EXPRESSION VECTORS 
CONFERRING TISSUE-SPECIFIC GENE EXPRESSION 

General Field of tlie invention 

5 The invention relates to stable self-replicating episoroal 
expression vectors for expressing a gene of interest in a 
host cell. 

Backgroiud of the Invention 

The expression of a foreign gene in a host cell is 
10 generally achieved by transferring the gene into the host 
cell using a gene transfer vector. Gene transfer vectors 
are available in the art and include for example retrovirus 
vectors, adenoviral vectors and adenoassociated viral 
vectors. Many transfer vectors operate by integrating at 
15 least the transferred gene, if not the complete gene 
transfer vector, into the host cell chromosome, although 
non-integrating transfer vectors are known in the art. The 
efficiency of stable integration of transfected gene 
constructs is generally very inefficient (1: 10^ - lo* ) . The 
20 disadvantage of using viral vector-based gene transfer is 
that the amount of genetic material that the vector is able 
to accommodate is limited by the genetic packaging 
limitations of the virus. Gene transfer vectors which 
include large fragments of inserted genetic material are 
25 difficult to produce at a sufficiently high titre to be of 
practical value. Therefore, it is difficult to include 
additional genetic material in a virus-based vector, for 
example, gene regulatory elements, without deleteriously 
affecting stable gene transfer. 
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liocus Control Regions (LCRs) (Grosveld et al.. Cell 51:975- 
985 (1987)), also known as Dominant Activator Sequences, 
Locus Activating Regions or Dominant Control Regions, confer 
tissue-specific, integration-site independent copy number- 
5 dependent expression on a linked gene that has been 
integrated into the chromosome of a host cell. LCRs were 
originally discovered in the human globin gene system, which 
exhibited strong position effects when integrated into a 
chromosome of a host cell in a tissue of a transgenic mouse 
10 or a mouse erythro-leukaemia (MEL) cell (see, for example, 
Magram et al.. Nature 315:338-340 (1985); Townes et al., 
EMBO J. 4:1715-1723 (1985); Kollias et al. , Cell 46:89- 
94(1986); Antoniou et al., EMBO *7. 7:377-384 (1988)). 
Position effects were overcome when the LCRs were linked 
15 directly to such transgenes (Grosveld et al., supra). Other 
LCRs have since been identified, including the B-globin LCR 
(fiLiCR) which promotes gene expression in erythroid tissue 
and the CD2 LCR which promotes gene expression in T cells 
(see, for example. Greaves et al.. Cell 56:979 (1989), 
20 European Patent Application EP-A-0 668 357) , the macrophage- 
specific lysozyme LCR (Bonifer et al., (1985, 1990)), and 
a class II MHC LCR (Carson et al.. Nucleic Acids Res. 
21,9:2065-2072 (1993) ) . 
It is an object of the invention to provide a stable gene 
25 transfer system which, when present in a host cell, confers 
stable and tissue-restricted expression of a gene of 
interest carried in the vector* 



Summary of the Invention 
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This invention provides self-replicating, LCR-containing, 
episomal expression vectors into which a gene of interest is 
inserted for expression of the gene in cells of a specific 
tissue-type. 

5 The invention therefore encompasses a self-replicating 
episomal DNA expression vector for expressing a gene of 
interest in a host cell in a tissue-restricted manner, the 
vector comprising: (a) an origin of replication capable of 
directing replication of the DNA expression vector in cells 
10 of the specific type of tissue; and (b) an LCR, or 
component thereof, which when operatively linked to a gene 
of interest and present in a host cell directs expression of 
the gene in a tissue-restricted manner. 
A vector according to the invention also may include the 
15 gene of interest inserted into a cloning site and 
operatively linked to the LCR. 

The term **gene" is used to define any DNA sequence capable 
of being expressed. The gene of interest may be a foreign 
or heterologous gene, that is, a gene that is either not 
20 normally found in the genomic DNA of the host cell or is not 
normally expressed in that host cell. The gene of interest 
also may be an artificial DNA sequence. 

The tissue-restricted expression of a gene of interest is 
mediated in vectors of the invention using an appropriate 
25 LCR, or components or portions thereof, which confer tissue- 
type specific expression on the gene of interest. 

As used herein, a locus control region" (LCR) is defined 
as a genetic element which is obtained from a tissue- 
specific locus of a eukaryotic host cell and which, when 
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linked to a gene of interest: and integrated into a 
chromosoiae of a host cell, confers tissue-specific, 
integration-site independent (position independent) , copy 
number-dependent expression on the gene of interest. An LCR 
5 that is useful according to the invention possesses these 
characteristics when integrated into chromosomal DKA, and 
will retain the ability to confer tissue-type restricted 
expression of a linked gene when present in a self- 
replicating episomal vector according to the invention. A 

10 "component" of an liCR refers to a portion of an LCR that 
also confers tissue-restricted gene expression when linked 
to a gene and integrated into a self -replicating episomal 
vector. An LCR may be identified structurally in that it 
is associated with one or more DNase X hypersensitive sites 

15 in its natural chromosomal context, and a component of an 
LCR useful according to the invention will also encompass at 
least one DNase I hypersensitive site. 

An "enhancer" is defined herein as a genetic element which 
increases the level of transcription of a linked gene when 

20 present on a self -replicating episomal vector, but which 
does not confer tissue-specific gene expression when present 
on the vector. 

It is preferred that the vector comprises a component of 
an LCR which confers tissue-specific expression and contains 
25 at least one DNase I hypersensitive site. In the human /9- 
globin LCR, the preferred component of the LCR consists 
essentially of HS3 ; also preferred is the human )8-globin LCR 
excluding site HS2; also preferred are sites HS3 and HS4 
together without site HS2. 
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The invention also encompasses a pair of vectors comprising 
a self-replicating episomal DNA expression system for 
expressing a gene of interest in a host cell in a tissue- 
restricted manner, the pair of vectors comprising: a first 
5 vector comprising (a) an origin of replication; 

(b) an LCR, or component thereof, which when operatively 
linked to a gene of interest and present in a host cell 
directs expression of the gene in a tissue-restricted 
manner; and 

10 (c) a Cloning site for a gene of interest; and a second 
vector comprising (d) an origin of replication; and (e) a 
sequence encoding a replication protein, the replication 
protein being necessary for replication of the origin of 
replication. 

In another embodiment, the second vector also may include 
an LCR, Which may be the same LCR as is present on the first 
vector or may be a different LCR which specifies the same or 
at least an overlapping tissue specificity as the first LCR 
such that the gene of interest and the viral replication 
20 gene are expressed in some of the same cells. 

It is preferred that for red cell-restricted gene 
expression, the ^-globin LCR from the ^-globin locus be 
used. AS used herein, -red cells- refer to cells of 
erythroid lineage. 

It is preferred that for T-cell restricted gene expression, 
the CD2 LCR from the CD2 locus be used, or a component 
thereof containing at least one DNase I hypersensitive site 
that directs T-cell restricted gene expression in an 
episomal context. it is preferred that for class li mhc- 



15 



25 
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bearing cell restricted gene expression the class II MHC LCR 
be used, or a component thereof containing at least one 
DNase I hypersensitive site that directs MHC-bearing cell 
restricted gene expression in an episomal context. It also 
5 is preferred that for macrophage cell restricted gene 
expression the macrophage/ lysozyme LCR be used, or a 
component thereof containing at least one DNase I 
hypersensitive site that directs macrophage-cell restricted 
gene expression in an episomal context. 

10 The term ** episomal vector" refers to a nucleic acid vector 
which may be linear or circular, and which is usually 
double-stranded in form. A vector according to the 
invention is generally within the size range of ikb - 
l,000kb, the preferred size range being on the order of 5kb 

15 - lOOkb. 

The terms " self -replicating" , "stably maintained" and 
"persistence" are defined herein as follows. The self- 
replicating function of a vector of the invention enables 
the vectors to be stably maintained in cells, independently 

2 0 of genomic DNA replication, and to persist in progeny cells 
for three or more cell divisions without a significant loss 
in copy number of the vector in the cells, i.e., without 
loss of greater than an average of about 50% of the vector 
molecules in progeny cells between a given cell division. 

25 This self -replicating function is provided by using a viral 
origin of replication and providing one or more viral 
replication factors that are required for replication 
mediated by that particular viral origin. Origins of 
replication and, if necessary, any replication factors may 
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be used from a variety of viruses, including Epstein-Barr 
virus (EBV) , human and bovine papilloma viruses, and 
papovavirus BK, 
In a preferred embodiment, the viral origin of replication 
5 is the oriP of EBV and the replication protein factor is the 
trans-acting EBNA-1 protein. EBNA-1 may be provided by 
expression of the EBNA-1 gene on the same episomal 
expression vector carrying OriP or on another vector in the 
cell or from an EBNA-1 gene in the genomic DNA of the host 
10 cell. 

In a preferred embodiment, a gene encoding a viral 
replication factor may be present on the self-replicating 
episomal vector, i.e., on either the same vector that 
carries the gene of interest or on another vector of a pair 

15 of vectors, and operatively linked to an LCR. 

As used herein, "linked" refers to a cis-linkage in which 
the gene of interest and/or the gene encoding a viral 
replication factor and the LCR are contained in the same 
vector and thus present in cis on the same DNA, and 

20 -operatively linked" refers to a cis linkage in which the 
gene of interest and/ or viral replication gene is subject to 
tissue-restricted expression via the LCR. 

Optionally, a transcription terminator may be introduced 
into a vector of the invention, preferably between the LCR, 

25 or component thereof, and the promoter of the gene of 
interest or the gene(s) encoding viral replication 
protein (s) to prevent undesirable transcription of these 
gene from other promoters that may be present on the 
vectors. Alternatively, a transcription terminator may be 
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introduced upstiream of the LCR and downstream of the gene of 
interest or viral replication gene(s) . 

The episomal expression vectors of the invention may be 
delivered to cells in vivo, bx vivo, or in vitro by any of 
5 a variety of the methods employed to deliver DNA molecules 
to cells. The vectors may also be delivered alone or in the 
form of a pharmaceutical composition that enhances delivery 
to cells in the body. 
In a preferred embodiment, the vectors of this invention 
10 are used in gene therapy to express a therapeutically useful 
protein in the cells of a specific diseased tissue, 
including tumor tissue, in the body. 

Vectors of the invention also are used to express a 
therapeutically useful protein in cells of a specific 
15 tissue-type in vitro. 

In another embodiment of this invention, the vectors 
described herein are used to produce a type of transgenic 
animal in which a foreign or heterologous gene is expressed 
only in a specific tissue type, as directed by the LCR, or 
20 component thereof, incorporated into the vector. The self- 
replicating function of the vector ensures that the vector 
will be passed on to the progeny of the transgenic animal. 
Such transgenic animals are particularly useful in testing 
the fidelity and efficacy of tissue-specific gene expression 
25 prior to clinical treatment of humans* 

In another embodiment of the present invention there is 
provided a method for identifying an LCR or component 
thereof which when comprised in an episomal DNA expression 
vector, operatively linked to a gene of interest and present 
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in a host cell directs expression of said gene in a tissue- 
restricted manner, comprising: 

i. testing the LCR or component thereof by transfecting 
5 an episomal vector containing the candidate LCR or 

component thereof operatively linked to a marker gene 
into a cell line in which the LCR when integrated is 
active and also into a cell line in which the LCR when 
integrated is inactive; and 

10 

ii. identifying the LCR or component thereof which is only 
active in the cell line in which the LCR when 
integrated is active. 

15 Further features and advantages of the invention are found 
in the following drawings, description and claims. 

Brief Description of the Drawings 

Fig. 1 shows a schematic map of an EBV based self- 
20 replicating expression vector p220.2. p220.2 is a 8952 bp 
plasmid which encodes EBNA-1, OriP and hygromycin 
resistance. It replicates as plasmid in 143 and HeLa 
cells. EBNA- I in this construct is driven off an unknown 
promoter located in the pBR322 sequences. DNA inserted 
25 upstream of EBNA-1 appears to eliminate expression of EBNA- 
1. bp 1-35 are pBR322. bp 36-2646 are EBV EBNA- 1 107567- 
110176 (Baer et al.. Nature., 310:207-211, 1984) BamHI- 
PvuII fragment. The BAMHI site was blunt-end ligated to the 
Hindlll site. bp 2647-4826 are EBV OriP 7333-9516 SphI- 
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Sstll sites blunt-end ligated to the BstElI site (Sugden at 
al., MCB, 5:410-413, 1985). bp 4827-5460 are HSV TK 
regulatory region (McKnight, NAR, 8:5949-5964, 1980). bp 
6488-6747 are a HSV TK PvuII fragment ligated into 
5 poisonless pBR322 at Nael site. This site is lost in 
cloning, bp 5461- 6487 are the HPH gene (Gritz and Davies, 
Gene^ 25:179-188, 1983) BamHI fragment blunt end ligated 
into the Smal and Bglll sites in HSV TK sequences, bp 6748- 
8952 are pBR322 poisonless vector (deletion of l. 1 kb in 

10 pBR322) confers ampicillin resistance (Lusky & Botchan, 
NaturB 293:79-81, 1981). The polylinker form pUC 12 (Smal- 
Haelll fragment) is inserted into a Narl site within the HSV 
TK sequences. (/) denotes blunt-end ligations. 
Fig. 2 shows a reporter gene construct: 

15 a) the )3-globin gene extending from a 5' Hpa I site at - 
815 bp to an EcoRV site 1685 bp past the poly A addition 
site in the plasmid GSE1758 (Talbot el al., EMBO J. 9:2169- 
2178, (1990)) was removed as a 4.1 kb EcoRV fragment and 
inserted into a blunted Sail site in the polylinker of 

20 p220.2 (Fig. 1). This cloning step brings a number of extra 
restriction sites (including a unique Sail site) 5» of the 
/3-globin gene. 

Fig. 3 shows )SI-Globin LCR Hypersensitive site constructs. 
Constructs which contain more than one hypersensitive site 
25 were designed such that the site order matches that of the 
wild type /S-globin locus. Sail linkers were added to both 
the 5 ' and 3 ' ends allowing the DNA to be cloned into the 
unique Sail site upstream of the /3-globin gene in the 
p220.2 reporter vector (Fig. 2). 
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Fig . 4 shows: 

a) An autoradlogram of an SI nuclease analysis of RNA from 
K562 cells transfected with self -replicating, episomal, 
expression vectors containing a human i9-globin reporter 
5 gene operatively linked to various combinations of HS 
components from the )9-globin LCR. Lane p 1 ()9-globin gene 
alone) , lanes 2fi l and 2 (HS2 operatively linked to 
globin gene) , lanes 3/9 1 and 2 (HS3 operatively linked to 
^-globin gene) , lanes 4/5 1 and 2 (HS4 operatively linked 
10 to /3-globin gene) , lanes 4/9 1 and 2 (HS4 operatively 
linked to )8-globin gene), lanes 23^3 1 and 2 (HS2/HS3 
combination operatively linked to /3-globin gene) , lanes 34/3 
1 and 2 (HS3/HS4 combination operatively linked to ^- 
globin gene), lanes 24/8 1 and 2 (HS2/HS4 combination 
15 operatively linked to /9 -globin gene) , lanes 2 34/9 l and 2 
(HS2/HS3/HS4 combination operatively linked to /9-globin 
gene) . Lane +Gamma refers to untransfected K562 cells as 
a negative control, and lane +beta refers to MEL cells 
transfected with a ^-globin gene plus ^LCR combination 
20 which acts as a positive control. All lanes are 
independent transf ections. 

b) End labelled DNA probes from the 5 '-region of the 
^-globin gene is shown schematically including the SI 
protected fragments. 
25 Fig. 5 presents autoradiograms of RNA blots from K562 or 
HeLa cells containing constructs described herein. In Panel 
A: lane M is a Hlnfl digest of pBR322 size markers; lane P 
is RNA from MEL cells stably transfected with the human /8- 
globin gene as a positive control; lane /? is the human /3- 
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globin gene alone; lane 2p Is the human ^-globin gene under 
the control of /9-globin LCR HS2; lane 2fi is the human fi- 
globin gene under the control of )3-globin LCR HS3 and lane 
5/S is the human /S-globin gene under the control of /3-globin 
5 LCR HS2, HS3 and HS4 ; and lane N is untransf ected K562 cells 
as a negative control. In Panel B, the lanes are the same as 
indicated for Panel A except lane N is unstransf ected HeLa 
cells as a negative control. 
Fig. 6 shows an autoradiogram an SI nuclease analysis of RNA 

10 from K562 cells transf ected with self -replicating, episomal, 
expression vectors containing a human /S-globin reporter gene 
operatively linked to various combinations of HS components 
from the /9 -globin LCR. Lane M is a marker; lane p is 
globin gene alone; lane 2)9 is )9-globin LCR HS2; lanes 3/9 are 

15 ^-globin LCR HS3 ; lanes 4)8 are )9-globin LCR HS4 ; lanes 22fi 
are )3-globin LCR HS2 and HS3; lanes 34/8 are )3-globin LCR HS3 
and HS4; lanes 24/3 are )8-globin LCR HS2 and HS4; lanes 234)9 
are /9-globin LCR HS2, HS3 and HS4; lanes 234)9i are )9-globin 
LCR HS2, HS3 and HS4 construct as a stable integrant; lanes 

2 0 )9i are )8 -globin gene alone as a stable integrant; lane N is 
untransf ected K562 cells as a negative control; and lane P 
is untransf ected MEL cells stably transf ected with the human 
)9-globin gene as a positive control. All lanes are 
independent transf ect ions. 

25 

Detailed Description of the Invention 

The contents of all references referred to herein are 
incorporated by reference thereto. 
The invention is based on the discovery that a locus 
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control region or component thereof may be used in a stable , 
self-replicating episomal vector containing a gene of 
interest to specify tissue-restricted expression of the gene 
of interest. 

5 The stable maintenance of an episomal vector containing an 
LCR in a host cell is achieved by using an origin of 
replication that is operative to initiate and replicate an 
episomal DNA independent of the host cell chromosomes and an 
origin replication factor. 

10 Origins of Replication and Replication Factors Useful 

According to the Invention 

Plasmid vectors for expressing heterologous genes in 
eukaryotic cells have been made using a portion of the 
genomic DNA of one or more eukaryotic viruses such as 
15 Epstein Barr Virus, human papova virus BK, adenovirus -based 
vectors, and bovine and human papilloma viruses. 

Episomal vectors of the invention comprise a portion of a 
virus genomic DNA that encodes an origin of replication 
(ori) , which is required for such vectors to be self- 
20 replicating and, thus, to persist in a host cell over 
several generations. In addition, an episomal vector of the 
invention also may contain one or more genes encoding viral 
proteins that are required for replication, i.e., 
replicator protein (s) • Optionally, the replicator 
25 protein (s) which help initiate replication may be expressed 
in trans on another DNA molecule, such as on another vector 
or on the host genomic DNA, in the host cell containing a 
self-replicating episomal expression vector of this 
invention. 
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Preferred self-replicating episomal LCR- containing 
expression vectors of the invention do not contain viral 
sequences that are not required for long-term stable 
maintenance in a eukaryotic host cell such as regions of a 
5 viral genome DNA encoding core or capsid proteins that would 
produce infectious viral particles or viral oncogenic 
sequences which may be present in the full-length viral 
genomic DNA molecule. 
The term "stable maintenance" herein, refers to the ability 

10 of a self-replicating episomal expression vector of this 
invention to persist or be maintained in non-dividing cells 
or in progeny cells of dividing cells in the absence of 
continuous selection without a significant loss (e.g. , >50%) 
in copy number of the vector for two and preferably five or 

15 more generations. The most preferred vectors will be 
maintained over 10-15 or more cell generations. In 
contrast, "transient** or "short-term** persistence of a 
plasmid in a host cell refers to the inability of a vector 
to replicate and segregate in a host cell in a stable 

20 manner; that is, the vector will be lost after one or two 
generations, or will undergo a loss of >51% of its copy 
number between successive generations. 

Several representative self-replicating, LCR-containing, 
episomal vectors of this invention are described further 

25 below. The self-replicating function may alternatively be 
provided by one or more mammalian sequences such as 
described by Wohlgemuth et al., 1996, Gene Therapy 3:503; 
Vos et al., 1995, Jour. Cell. Biol., Supp. 21A, 433; and Sun 
et al., 1994, Nature Genetics 8:33, optionally in 



wo 98/07876 



15 



PCT/GB97/02213 



combination with one or more sequence which may be required 
for nuclear retention. The advantage of using mammalian, 
especially human sequences for providing the self- 
replicating function is that no extraneous activation 
5 factors are required which could have toxic or oncogenic 
properties. It will be understood by one of skill in the 
art that the invention is not limited to any one origin of 
replication or any one episomal vector, but encompasses the 
combination of the tissue-restricted control of an LCR in an 
10 episomal vector. 

1. Epstein-Barr Virus-Based Self -Replicating Episomal 
Expression Vectors Useful According to the Invention. 

The latent origin oriP from Epstein-Barr Virus (EBV) is 
described in Yates et. al., Proc. Natl. Acad. Sci. USA 
15 81:3806-3810 (1984); Yates et al.. Nature 313:812-815 
(1985); Krysan et al., Mol . Cell. Biol. 9:1026-1033 (1989); 
James et al. , Gene 86: 233-239 (1990), Peterson and 
Legerski, Gene 107:279-284 (1991); and Pan et al.^ Som. Cell 
Molec. Genet. 18:163-177 (1992)). An EBV-based episomal 
20 vector useful according to the invention will contain the 
oriP region of EBV which is carried on a 2.61 kb fragment of 
EBV (see Fig, 1) and the EBNA-1 gene which is carried on a 
2.18 kb fragment of EBV (See Fig. 1). one vector which 
carries the oriP and EBNA-1 gene of EBV is shown in Fig. l. 
25 The vector shown in Fig. 1 also contains an antibiotic 
resistance gene for selection of stable transfected 
eukaryotic cells in culture, a poly linker cloning site for 
insertion of a gene of interest, and a portion of pBR322 for 
production of vector DNA in bacterial host cells. 
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The EBNA-1 protein, which is the only viral gene product 
required to support in trans episomal replication of vectors 
containing oriP, may be provided on the same episomal 
expression vector containing oriP (see, for example, James 
5 et al., supra; Peterson and Legerski, supra/ Pan et al., 
supra; • it is also understood, that as with any protein 
such as EBNA-1 known to be required to support replication 
of viral plasmid in trans, the gene also may be expressed on 
another DNA molecule, such as a different DNA vector or the 

10 host genomic DNA, in the host cell containing the episomal 
expression vector of the invention. 

2. Papilloma Virus-Based, Self -Replicating, Episomal 
Expression Vectors Useful According to the Invention. 
The episomal expression vectors of the invention also may 

15 be based on replication functions of the papilloma family of 
virus, including but not limited to Bovine Papilloma Virus 
(BPV) and Human Papilloma Viruses (HPVs) • BPV and HPVs 
persist as stably maintained plasmids in mammalian cells. V 
trans-acting factors encoded by BPV and HPVs, namely El and 

20 E2, have also been identified which are necessary and 
sufficient for mediate replication in many cell types via 
minimal origin of replication (Ustav et al., EMBO J. 10: 
449-457 (1991); Ustav et al . , EMBO J. 10:4231-4329, (1991); 
Ustav et al., Proc. Natl* Acad. Sci. USA 90: 898-902 

25 (1993)). 

An episomal vector useful according to the invention is the 
BPV-I vector system described in Piirsoo et al., EMBO J., 
15:1 (1996) and in WO 94/12629. The BPV-1 vector system 
described in Piirsoo et al. comprises a plasmid harboring 
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the BPV-1 origin of replication (minimal origin plus 
extrachromosomal maintenance element) and optionally the El 
and E2 genes. The BPV-1 El and E2 genes are required for 
stable maintenance of a BPV episomal vector. These factors 
5 ensure that the plasmid is replicated to a stable copy 
number of up to thirty copies per cell independent of cell 
cycle status. The gene construct therefore persists stably 
in both dividing and non-dividing cells. This allows the 
maintenance of the gene construct in cells such as 
10 hemopoietic stem cells and more committed precursor cells. 

The BPV origin of replication has been located at the 3' 
end of the upstream regulatory region within a 60 base pair 
(bp) DNA fragment (nucleotides (nt) 7914 - 7927) which 

15 includes binding sites for the El and E2 replication 
factors. The minimal origin of replication of HPV has also 
been characterized and located in the URR fragment (nt 7022- 
7927) of HPV (see, for example, Chiang et al., Proc. Natl. 
Acetd. Sc±. USA 89:5799-5803 (1992)). 

20 As used herein, «E1" refers to the protein encoded by 
nucleotides (nt) 849-2663 of BPV subtype 1 or by nt 832- 
2779 of HPV of subtype 11, to equivalent El proteins of 
other papilloma viruses, or to functional fragments or 
mutants of a papilloma virus El protein, i.e., fragments or 

25 mutants of El which possess the replicating properties of 
El. 

As used herein, «E2" refers to the protein encoded by nt 
2594-3837 of BPV subtype 1 or by nt 2723-3823 of HPV subtype 
11, to equivalent E2 proteins of other papilloma viruses, or 
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to functional fragments or mutants of a papilloma virus E2 
protein, i.e., fragments or mutants of E2 which possess the 
replicating properties of E2. 
"Minichromosomal maintenance element" (MME) refers to the 
5 extrachromosomal maintenance element of the papilloma viral 
genome to which viral or human proteins essential for 
papilloma viral replication bind, which region is essential 
for stable episomal maintenance of the papilloma viral MO in 
a host cell, as described in Piirsoo et al. (supra). 

10 Preferably, the MME is a sequence containing multiple 
binding sites for the transcriptional activator E2. The MME 
in BPV is herein defined as the region of BPV located within 
the upstream regulatory region which includes a minimum of 
about six sequential E2 binding sites, and which gives 

15 optimum stable maintenance with about ten sequential E2 
binding sites. E2 binding site 9 is a preferred sequence 
for this site, as described hereinbelow, wherein the 
sequential sites are separated by a spacer of about 4-10 
nucleotides, and optimally 6 nucleotides. El and E2 can be 

20 provided to the plasmid either in cis or in trans, also as 
described in WO 94/12629 and in Piirsoo et al. (supra). 

*'E2 binding site" refers to the minimum sequence of 
papillomavirus double-stranded DNA to which the E2 protein 
binds. An E2 binding site may include the sequence 5* 

25 ACCGTTGCCGGT 3', which is high affinity E2 binding site 9 of 
the BPV-1 URR; alternatively, an E2 binding site may include 
permutations of binding site 9, which permutations are found 
within the URR, and fall within the generic E2 binding 
sequence 5' ACCN6GGT 3*. One or more transcriptional 
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act:ivat:or E2 binding si1:es are. In most papillomaviruses, 
located in the upstream regulatory region, as in BPV and 
HPV. 

A vector which also is useful according to the invention 
5 may include a region of BPV between 6959 - 7945/1 - 470 on 
the BPV genetic map (as described in WO 94/12629) , which 
region includes an origin of replication, a first promoter 
operatively associated with a gene of interest, the BPV El 
gene operatively associated with a second promoter to drive 

10 transcription of the El gene; and the BPV E2 gene 
operatively associated with a third promoter to drive 
transcription of the E2 gene. 

El and E2 from BPV will replicate vectors containing the 
BPV origin or the origin of many HPV subtypes (Chiang et 

15 al., supra}. El and E2 from HPV will replicate vectors via 
the BPV origin and via the origin of many HPV subtypes 
(Chiang et al., supra). As with all vectors of the 
invention, the BPV-based episomal expression vectors of the 
invention must persist through 2-5 or more divisions of the 

20 host cell. 

3. Papovavirus-Based, Self -Replicating, Episomal 

Expression Vectors Useful According to the Invention. 

The vectors of the invention also may be derived from a 
human papovavirus BK genomic DNA molecule. For example, the 
25 BK viral genome can be digested with restriction enzymes 
EcoRI and BamHI to produce a 5 kilobase (kb) fragment that 
contains the BK viral origin of replication sequences that 
can confer stable maintenance on vectors (see, for example, 
De Benedetti and Rhoads, Nucleic Acids Res. 19:1925 (1991), 
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as can a 3,2 kb fragment of the BK virus (Cooper and Mlron, 
Human Gene Therapy 4:557 (1993)). 
4, Regulation of Origin of Replication Factor Expression. 
As noted above, the vectors of the invention may contain 
5 one or more genes encoding a trans-acting viral replication 
factor required for stable maintenance of the vectors in 
host cells. 

The regulatory elements, e.g., promoters, enhancers, LCRs, 
which drive expression of the replicator gene(s) thus may be 
identical or different, and may provide identical or 
overlapping tissue-specificity as the LCR, such as, for 
example, for a tumor that is expressed in B-cells, the 
immunoglobulin heavy chain promoter/ enhancer for B-cells and 
the Ig heavy or light chain promoters for blood cell 
expression, or for tumors that are present in an unspecified 
cell type, and also may include a promoter from a 
ubiquitously expressed gene, for example from the 
phosphoglycerolkinase, lE-CMV, RSV-LTR or DHFR genes. The 
arrangement of replicator gene(s) relative to the episomal 
vector origin of replication may mimic the natural 
orientations of these sequences in the virus genome, or it 
may assume a variety of other orientations, the choices of 
which will be apparent to one of skill in the art. 

In EBV-based episomal vectors, it is often preferable that 
the transcription of a viral gene encoding a replication 
factor be placed under the control of a relatively weak 
promoter. For example, expression of the gene encoding 
EBNA-1 can be toxic to most cells and/ or result in 
undesirable rearrangements in vector sequences when the gene 
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Is expressed using the CMV promoter, a well known, 
relatively strong, eukaryotic promoter. In contrast, 
expression of the EBNA-1 gene using a relatively weak 
eukaryotic promoter sequence permits EBNA-1 to function to 
5 maintain a desirable level (i.e., a level that is non-toxic 
to cells in culture) of vector replication in host cells. 
Such relatively weak promoter sequences useful in the 
vectors of the invention are found in a portion of the 
sequence of the bacterial plasmid pBR322 and the thymidine 
10 kinase (tk) promoter. 

Preferred promoters for El and E2 expression in 
papillomavirus-based vectors include the thymidine kinase 
promoter, the SV40 early promoter, the CMV promoter and the 
SRa promoter 5 Manufacture of Vector DNA. 
15 In another embodiment of the invention, vectors are made 
comprising culturing a cell containing the vector of the 
present invention in order to produce sufficient DNA for use 
according to the invention. It is particularly preferred 
that such manufacture occurs in lower eukaryotic cells, 
20 e.g., yeast or insect, or prokaryotic cells, e.g., bacterial 
cells such as E. coll or Salmonellst. Therefore, it is 
preferred that a vector of this invention will further 
comprise an origin of replication of yeast, insect or 
bacterial origin, e.g., the pBR322 origin of replication, 
25 and one or more genes encoding a selectable marker, e.g., a 
gene encoding kanamycin resistance, for selection of cells 
containing the vector and/or a marker gene such as Lac Z. 

LCRs. and C omponents Thereof. For Providing Tlssue-Specif ic 
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gen^ Expression 

Useful According to the Invention 

The invention contemplates the combination of a stable, 
self-replicating episomal vector and a locus control region 
5 or component thereof for tissue-type restricted expression 
of a gene contained in the vector. 

Without being bound to any one theory, it is believed that, 
in the genomic DNA context, the LCR promotes an opening of 
the usually tightly wound structure of the chromatin in 

10 cells of a specific tissue type and thereby permits 
transcription of the genes in the region of the opened 
chromatin in a tissue-specific manner. 

In addition, LCRs are capable of conferring tissue- 
restricted gene expression whether they are located upstream 

15 or downstream of a linked gene, and regardless of 
orientation with respect to the gene. LCRs useful in an 
episomal vector according to the invention may be placed 
upstream or downstream or in any orientation with respect 
to the linked gene. 

20 Prior to the invention, it was not known whether an LCR, 
when present in an episomal rather than a chromosomal 
context, could confer tissue-restricted gene expression. 
Furthermore, in transient transf ections of a vector, LCR 
components were found not to increase expression levels 

25 unless the LCR component comprised a classical enhancer 
element (Tuan et al., 1989, Proc. Nat. Aca. Sci 86:2544). 

In the self -replicating, episomal expression vectors 
described herein, an LCR, or component thereof, is 
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operatively linked to a selected gene present on the vectors 
such that the gene expression occurs only in cells of the 
particular tissue type in which the LCR, or component 
thereof, is known to function* While not wishing to be 
5 limited by a mechanism, it would appear that the self- 
replicating vectors described herein are stably maintained 
in a conformation similar to that of the host cell's natural 
chromatin which the LCR, or component thereof, may be 
capable of opening and/ or then promoting the transcription 
10 of sequences to which the LCR, or component thereof, is 
linked* 

In general, whether an LCR or component, or combination of 
components thereof can be used to mediate tissue specific 
expression of a foreign gene is easily assayed using a self- 

15 replicating episomal vector containing the LCR or component 
thereof operatively linked to a reporter gene* Expression 
of the reporter gene in a tissue-specific manner can be 
assayed using standard methods and by comparing reporter 
gene expression in cells of the tissue type for which the 

20 LCR is specific versus reporter gene expression in cells of 
a different tissue type. A variety of reporter genes and 
vectors containing reporter genes are commercially available 
which can be used for testing promoter and /or enhancer 
functions. Standard reporter genes used in the art include 
25 the lacZ gene encoding /?-galactosidase, the CAT gene 
encoding chloramphenicol acetyl transferase, the lucif erase 
gene system, and the gene encoding alkaline phosphatase 
(see, for example, James et al., (supra); Peterson and 
Legerski, (supra); Pan et al., (supra)). 
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For example, the ^-globin LCR contains well-defined 
portions which are DNA fragments encompassing DNase J 
hypersensitive sites, i.e., HSl, HS2, HS3 , and HS4, located 
upstream of the fetal globin genes in the )3-globin locus; 
5 such sites and their detection is described in Tuan et al., 
supra, 1985) • It has been discovered that, when a 
combination of HS2 , HS3, and HS4 components is used in an 
episomal vector according to the invention, the combination 
of liCR components containing HS2 does not confer tissue- 

10 specific expression of a foreign gene in erythroid cells; 
similarly, a combination of HS2 and HS3 does not confer 
tissue-specific expression. However, it also has been 
discovered the HS3 component alone is effective in 
conferring tissue specific expression of a linked gene as 

15 demonstrated in the Examples below; however, the HS2 
component, which is known to be an enhancer, does not 
possess this activity when present as the sole LCR component 
on the episomal vector. It is contemplated according to the 
invention that the absence of HS2 in the episomal )3-globin 

20 LCR results in tissue-specific gene expression. 

Additional LCRs useful according to the invention include 
but are not limited to the CD2 LCR which promotes gene 
expression in T cells (see, for example. Greaves et al., 
(supra)) the macrophage-specif ic lysozyme LCR (Bonifer et 

25 al., (supra)), and a class II MHC LCR (Carson et al., 
(supra) ) • 

One of skill in the art may, using the benefit of this 
disclosure, which identifies a )3-globin LCR component that 
confers tissue-specific gene expression when carried on an 
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episome and methods for finding other such LCRs or their 
components, and techniques available in the art, can 
identify additional LCRs, or a component thereof, which when 
carried on an episomal vector confers tissue-specific gene 
5 expression. 

To identify a component of an LCR that is useful according 
to the invention (i.e., in an episomal context) one of skill 
in the art may test each LCR or component thereof (i.e., 
DNase I hypersensitive site of an LCR) individually by 
10 transfecting an episomal vector containing the candidate 
LCR component and a marker gene into a cell line in which 
the LCR is active (i.e. active when integrated into the 
genome (specific cell line) ) and also into a cell line in 
which the LCR is known to be inactive (i.e. inactive when 
15 integrated into the genome (non-specific cell line) ) . If 
following selection for stable episomes the LCR component is 
active in the non-specific cell line, i.e. indicating that 
the LCR component is not tissue specific (such as HS2 of the 
i9-globin LCR) , then the LCR component is excluded as a 
20 candidate LCR component useful according to the invention. 
An LCR component which is not active in the non-specific 
cell line but is active in the specific cell line is useful 
according to the invention in that it confers tissue 
specificity in the context of a stable episome. LCR 
25 components which are indicated as being useful according to 
the invention may then be combined and used together to give 
potentially enhanced levels of tissue specific expression. 

Optionally, one or more transcription terminators may also 
be incorporated into the vectors of the invention to prevent 
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undesirable transcription from other known or latent 
promoter sequences in the vector which might proceed through 
(i.e., transcriptional readthrough) the gene whose 
expression has been placed under the exclusive control of 
5 a particular LCR, or component thereof. An example of such 
a useful transcription terminator is the ^-globin 
terminator, though other eukaryotic transcription 
terminators also may be used in the vectors described 
herein. Preferably, the transcription terminator is placed 
10 between the LCR, or component thereof, and the promoter of 
the gene to which the LCR is operatively linked. 

Selecting Heterologous or Foreign Genes of Interest 
The vectors of the invention are particularly useful in 

15 expressing heterologous or foreign genes in cells of a 
specific tissue-type. 

Examples of therapeutic nucleic acid sequences which may 
be incorporated into an episomal vector include the 
following. Therapeutically useful nucleic acid sequences 

20 include sequences encoding receptors, enzymes, ligands, 
regulatory factors, and structural proteins. Therapeutic 
nucleic acid sequences also include sequences encoding 
nuclear proteins, cytoplasmic proteins, mitochondrial 
proteins, secreted proteins, plasmalemma-associated 

25 proteins, serum proteins, viral antigens, bacterial 
antigens, protozoal antigens and parasitic antigens. 
Nucleic acid sequences useful according to the invention 
also include sequences encoding proteins, lipoproteins, 
glycoproteins, phosphoproteins and nucleic acid (e.g. , RNAs 
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or antisense nucleic acids) . Proteins or polypeptides which 
can be expressed using the episomal vector of the present 
invention include hormones, growth factors, enzymes, 
clotting factors, apolipoproteins, receptors, drugs, 
5 oncogenes, tumor antigens, tumor suppressors, viral 
antigens, parasitic antigens and bacterial antigens. 
Specific examples of these compounds include proinsulin, 
growth hormone, androgen receptors, insulin- like growth 
factor I, insulin-like growth factor II, insulin-like growth 
10 factor binding proteins, epidermal growth factor TGF-a, TGF- 
fi, PDGF, angiogenesis factors (acidic fibroblast growth 
factor, basic fibroblast growth factor and angiogenin) , 
matrix proteins (Type IV collagen. Type VII collagen, 
laminin) , phenylalanine hydroxylase, tyrosine hydroxylase, 
15 oncogenes (ras, fos, myc, erb, src, sis, jun) , E6 or E7 
transforming sequence, p53 protein, Rb gene product, 
cytokine receptor, Il-l, IL-6, IL-8, viral capsid protein, 
and proteins from viral, bacterial and parasitic organisms 
which can be used to induce an immunologic response, and 
20 other proteins of useful significance in the body. The gene 
which can be incorporated is only limited by the 
availability of the nucleic acid sequence encoding the 
protein or polypeptide to be incorporated. One skilled in 
the art will readily recognize that as more proteins and 
25 polypeptides become identified they can be integrated into 
the episomal vector and expressed in the animal or human 
tissue. 

Vectors of the invention also may be used to express genes 
that are already expressed in a host cell (i.e., a native or 
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homologous gene) , for example, to increase the dosage of the 
gene product. It should be noted, however, that expression 
of a homologous gene may result in deregulated expression 
which may not be subject to control by the LCR, or component 
5 thereof, due to its over-expression in the cell. 

Vector Delivery 

Numerous techniques are known and useful according to the 
invention for delivering self -replicating, LCR (or component 

10 thereof ) -containing, episomal expression vectors described 
herein cells including the use of nucleic acid condensing 
agents, electroporation, complexation with asbestos, 
polybrene, DEAE cellulose, Dextran, liposomes, 
lipopolyamines, polyornithine, particle bombardment and 

15 direct microinjection (reviewed by Kucherlapati and 
Skoultchi, Crit. Rov. Biochom. 16:349-379 (1984); Keown et 
al.. Methods Enzymol. 185:527 (1990)). 

A vector of the invention may be delivered to a host cell 
non-specif ically or specifically (i.e, , to a designated 

20 subset of host cells) via a viral or non-viral means of 
delivery. Preferred delivery methods of viral origin 
include viral particle-producing packaging cell lines as 
transfection recipients for the vector of the present 
invention into which viral packaging signals have been 

25 engineered, such as those of adenovirus, herpes viruses and 
papovaviruses . Preferred non-viral based gene delivery 
means and methods may also be used in the invention and 
include nucleic acid condensing peptides, encapsulation in 
liposomes, and transfection of cells ex vivo with subsequent 
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reimplantation or administration of the transfected cells. 

Various peptides derived from the amino acid sequences of 
viral envelope proteins have been used in gene transfer when 
co-administered with polylysine DNA complexes (Plank et al., 
5 J. Biol. Chem. 269:12918-12924 (1994)); Trubetskoy et al., 
BioconjugatB Chem. 3:323-327 (1992); WO 91/17773; WO 
92/19287; and Mack et al.. Am. J. Med. Sci. 307:138-143 
(1994)) suggest that co-condensation of polylysine 
conjugates with cat ionic lipids can lead to improvement in 
10 gene transfer efficiency. PCT publication number WO 
95/02698 discloses the use of viral components to attempt to 
increase the efficiency of cationic lipid gene transfer. 

Nucleic acid condensing agents useful in the invention 
include spermine, spermine derivatives, histones, cationic 
15 peptides and polylysine. Spermine derivatives refers to 
analogues and derivatives of spermine and include compounds 
as set forth in International Publication No. WO 93/18759 
(published September 30, 1993). 
Disulfide bonds have been used to link the peptidic 
20 components of a delivery vehicle (Gotten et al.. Moth. 
Enzymol. 217:618^644 (1992)); see also, Trubetskoy et al. 
(supra) . 

Delivery vehicles for delivery of DNA constructs to cells 
are known in the art and include DNA/poly-cation complexes 
25 which are specific for a cell surface receptor, as described 
in, for example, Wu and Wu, J. Biol. Chem. 263:14621 
(1988); Wilson et al., J. Biol. Chem. 267:963-967 (1992); 
and U.S. Patent No. 5,166,320). 
Delivery of a vector according to the invention is 
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contemplated using nucleic acid condensing peptides. A 
nucleic acid condensing peptide which is particularly useful 
for condensing the nucleic acid construct and therefore for 
delivering nucleic acid to a cell includes an amino acid 
5 sequence of the generic formula 

NH2-A-(X^X2YiY2)„X3X^-(Z,Z2Z3ZJ - (X^X^YjY J ^X^XaBCOOH 
wherein each of X^.g is, independently, an amino acid having 
a positively charged group on the side chain; wherein each 
of Y^-^ is, independently, a naturally occurring amino acid 

10 which promotes alpha helix formation; wherein each of Z^.^ 
is, independently, a naturally occurring amino acid with at 
least 3 amino acids having a high propensity to form a 
stabilized turn structure; wherein A is an amino-terminal 
serine or threonine residue; wherein B is any amino acid; 

15 and wherein n 2 - 4 and m = 2. 

Other peptides are those wherein each of X^_g is, 
independently, lysine, arginine, 2 .4-diamino-butyric acid or 
ornithine; wherein each of Y,-^ is, independently, glutamic 
acid, alanine, leucine, methionine, glutamine, tryptophan or 

20 histidine; wherein each of Z,.^ is, independently, 
asparagine, glycine, proline, serine, and aspartic acid; 
wherein B is any one of alanine, glutamic acid or cysteine. 

It is also contemplated according to the invention that 
peptides useful in this embodiment of the invention which 

25 involves delivery of a nucleic acid to a cell either bx 
vivo or in vivo may contain one or more internal Serine, 
Threonine, or Cysteine residues, preferably at a position in 
the sequence which will be exposed for conjugation to a 
selected ligand, and thus not on the positively charged 
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(nucleic acid oriented) face of the a-helix. This 
positioning of selected reactive amino acid residues within 
the peptide are oriented such that they do not contact the 
face of the peptide that contacts nucleic acid permits 
5 conjugation of the peptide with other functional peptides by 
bonds of selected and defined stability. Cysteine allows 
specific conjugation via the thiol side chain to compounds 
containing other reactive thiol groups (via disulfides) , 
alkylating functions (to form thioether bonds) , or other 
10 thiol reactive groups such as maleimide derivatives. 

Peptides which fall within this generic sequence include: 
NBC7 TRI^lOT«AKIWAARRCGVSARRAARRAWRRE-OH; and 
NBCll H-TKKAWKKAEKKAAKKCGVSAKKAAKKAWKKA-NHo 
Thus, a nucleic acid condensing peptide useful for delivery 
15 of a nucleic acid may contain: 1) helix-forming amino acids, 
2) a repeating three-dimensional structure that contacts the 
major groove of the nucleic acid, 3) suitable chromophores 
for quantitation, and 4) a number of "handles" (i.e., 
reactive sites) for regio-specif ic conjugation of ligands 
20 which form accessory functional domains. 

Nucleic acid condensing peptides also may include portions 
of HI (sequence I, II or III below) or sequences from other 
human histones (sequence IV below) which are identified 
herein as sequences which possess the ability to condense 
25 nucleic acid. Therefore, a nucleic acid condensing peptide 
can include a linear combination of the following three 
consensus sequences where the total sequence length is >17 
residues: 

Sequence I: -K-K-X-P-K-K-Y-Z-B-P-A-J- 
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Where: K is Lysine, P is Proline; A is Alanine; X is Serine, 
Threonine or Proline; Y is Alanine or Valine; Z is Alanine, 
Threonine or Proline; B is Lysine, Alanine, Threonine or 
Valine; and J is Alanine or Valine. 
5 Sequence II: -X-K-S-P-A-K-A-K-A- 

vhere: X is Alanine or Valine; K is Lysine; S is Serine; P 

is Proline; and A is Alanine. 

Sequence III: -X-Y-V-K-P«-K-A-A-K-Z-K-B- 

where: X is Lysine or Arginine; Y is Alanine or Threonine; 
10 Z is Proline, Alanine or Serine; B is Lysine, Threonine or 
Valine; K is Lysine; P is Proline; A is Alanine. 
Sequence IV 

-A-B-C-D-E-F-G-H-I-J-K- 
Where: A is preferably Lysine or Threonine; B is preferably 

15 Glycine or Glutamine; C is preferably Glycine, but can also 
be Aspartate, Glutamate, or Serine; D is preferably Glycine, 
but can also be Lysine, Valine, Glutamine, or Threonine; E 
is preferably Lysine or Alanine; F is preferably Alanine or 
Lysine, G is preferably Arginine, but can also be Valine or 

20 Isoleucine; H is preferably Alanine, but can also be 
Threonine, Histidine, or Proline; I is preferably Lysine, 
Arginine, or Glutamine; J is Alanine or Anginine; and K is 
preferably Lysine or Glutamine. A preferred consensus 
sequence is : 

2 5 -K-G-G-G-K-A-R-A-K-A-K- . 

One such peptide is NBCl, which has the following 
structure : 

NH2-[SV40 NLS]-[Seq I]-[Seq IlJ-CSeq III]-[SV40 NLS]-[Seq 
I]-C-COOH, where -C- is Cysteine; where the SV40 NLS has the 
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sequence Pro-Lys-Lys-Lys-Arg-Lys-Val-Gln; and the sequence 
H-PKKKMCVEKKSPKKAIOCPAAKSPAKAKAKAVKPKAAKPKKP^ 
KKAKKPAAC ( Acm) -OH . 
Another such nucleic acid condensing peptide of the 
5 invention will have an amino acid sequence that falls within 
the following generic sequence: 

NH2-X-(Y)^-C-COOH, where X is either absent or Serine or 
Threonine; Y is sequence I, II or III as defined above; n is 
2-6; and C is Cysteine ♦ 
10 Other such peptides have the following structures and 
sequences: NBC2 has the structure: NHg-CSeq III]-[SV40 
NLSl]-[Seq I]-C-COOH, where -C- is Cysteine. 

NBC8 has the structure: NHg-CSeq I3-[Seq I3-C-COOH, where - 
C- is Cysteine. 

15 NBC9 has the structure: NHg-tSeq I]-[Seq I]-[Seq I]-C- 
COOH, where -C- is Cysteine. 

NBCIO has the structure: NHg-CSeq I]-[Seq I]-[Seq I]-[Seq 
I]-C-COOH 

Where -C- is Cysteine; the amino acid sequences of which are 
20 as follows: 

NBC2 H-KAVKPKAAKPKKPKKKRKVEKKSPKKAKKPAAC (Acm) -OH ; 

NBCB H-KKSPKKAKKPAAKKSPKKAKKPAAC ( Acm) -OH ; 

NBC9 H-KKSPKKAKKPAAKKSPKKAKKPAAKKSPKKAKKPAAC (Acm) -OH ; 

NBCIO H-KKSPKKAKKPAAKKSPKKAKKPAAKKSPKKAKKPAAKKSPKKAKKP 
25 AAC(Acm)-OH. 

As described above, nucleic acid condensing peptides having 
a low polydispersion index (PDI) are useful for delivery to 
a cell of a nucleic acid according to the invention. The 
PDI for such peptides may be calculated from analysis of 
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the peptides by electro-spray mass spectrometry. This 
method gives the exact mass of each component to within 
0«001%. The PDI values of the peptide preparations useful 
in the present invention are in the range of i.o • 1.100. 
5 Peptide preparations which are especially useful in the 
invention possess a PDI <1.01, and even <1.001. 

The first nucleic acid condensing peptide may include 8- 
24 positively charged amino acid side groups; for example, 
the nimber of positively charged amino acid side groups may 

10 be in the range of 12-18. 

The ratio of positive/negative charges in a synthetic virus 
like particle that is capable of targeting a specific 
mammalian cell type is within the range 0.5-3 per phosphate 
residue in the nucleic acid; this ratio thus also may be 

15 within the range 0.8 - 1.2. 

The ratio of positive/negative charges in a synthetic virus 
like particle that is unrestricted with respect to the type 
of cell it targets is in within the range of 0.5 - 5 per 
phosphate residue in the nucleic acid, and thus also may be 

20 within the range of 1.2 - 2. 

Functional groups may be bound to peptides useful for 
delivery of a vector according to the invention. These 
functional groups may include a ligand that targets a 
specific cell-type such as a monoclonal antibody, insulin, 

25 transferrin, asialoglycoprotein, or a sugar. The ligand 
thus may target cells in a non-specific manner or in a 
specific manner that is restricted with respect to cell 
type. 

The functional group also may comprise a lipid, such as 
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palmitoyl, oleyl, or stearoyl; a neutral hydrophilic polymer 
such as polyethylene glycol (PEG) , or polyvinylpyrrolidine 
(PVP) ; a fusogenic peptide such as the HA peptide of 
influenza virus; or a recombinase or an integrase. The 
5 functional group also may comprise an intracellular 
trafficking protein such as a nuclear localization sequence 
(NLS) . 

In a particularly preferred embodiment for delivery, that 
is, wherein the second functional group is covalently linked 
10 to a first functional group which is linked directly to the 
nucleic acid condensing peptide, the first functional group 
may comprise one of a lipid or a neutral hydrophilic polymer 
such as PEG and the second functional group may comprise a 
ligand that targets a cell surface receptor. For example, 
15 when the first functional group comprises a lipid, the 
second functional group may comprise a ligand that targets 
a cellular receptor. When the first functional group 
comprises PEG, the second functional group may comprise a 
ligand that targets a cellular receptor. The ligand may be, 
20 for example, one of a sugar moiety or a ligand whose 
cellular receptor is restricted to a cell-type, and thus the 
target cell population may be unrestricted or restricted as 
to cell type. Alternatively, when the first functional 
group comprises a lipid, the second functional group may 
25 comprise PEG. 

The above-described nucleic acid condensing peptide /vector 
DNA composition may be prepared as follows. The composition 
is formulated such that the nucleic acid and the peptide 
preparation are prepared in equal volumes of the same buffer 
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(usually 0.15M to l.OM NaCl; 25 mM HEPES, pH 7.4). The 
nucleic acid Is shaken or vortexed while the condensing 
peptide preparation Is added at the rate of O.l volume per 
minute. The complex Is left at room temperature for at least 
5 30 minutes prior to addition to the target cells or prior to 
administration to a subject, and can be stored at 4**C. The 
particle Is centrlfuged to remove any aggregated material. 

In addition to the above-described DNA/polycation complexes 
for cell targeting, methods are known in the prior art for 

10 preparing cell-targeting liposomes containing nucleic acid. 
Liposomes are hollow spherical vesicles composed of lipids 
arranged in a similar fashion as those lipids which make up 
the cell membrane. They have an internal aqueous space for 
entrapping water soluble compounds and range in size from 

15 0.05 to several microns in diameter. Several studies have 
shown that liposomes can deliver nucleic acids to cells and 
that the nucleic acid remains biologically active. 

For example, a liposome delivery vehicle originally 
designed as a research tool, Lipofectin, has been shown to 

20 deliver intact mRNA molecules to cells yielding production 
of the corresponding protein. 

Liposomes offer several advantages: They are non-toxic and 
biodegradable in composition; they display long circulation 
half -lives; and recognition molecules can be readily 

25 attached to their surface for targeting to tissues. 
Finally, cost effective manufacture of liposome-based 
pharmaceuticals, either in a liquid suspension or 
lyophilized product, has demonstrated the viability of this 
technology as an acceptable drug delivery system. An 
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example of targeting liposomes is immuno liposomes. 
Liposomes are prepared, for example, by adsorption of 
proteins (e.g., immunoglobulin) on the liposomal surface; 
incorporation of native protein into the liposome membrane 
5 during its formation (e.g., by ultrasonication, detergent 
dialysis or reverse phase evaporation) ; covalent binding 
(direct or via a spacer group) of a protein to reactive 
compounds incorporated into the liposomes membrane; 
noncovalent hydrophobic binding of modified proteins during 
10 liposome formation or by the incubation with preformed 
liposomes) ; and indirect binding, including covalent binding 
of immunoglobulin protein via a polymer to the liposome (see 
Torchilin, V.P. CRC Critical Reviews in Therapeutic Drug 
Carrier Systems, 2(1)). Binding of the nucleic acid-ligand 
15 complex to the receptor facilitates uptake of the nucleic 
acid by receptor-mediated endocytosis. 

A nucleic acid-ligand complex linked to adenovirus capsids, 
which naturally disrupt endosomes, thereby releasing 
Jnaterial into the cytoplasm, can be used to avoid 
20 degradation of the complex by intracellular lysosomes (see 
for example Curiel et al., Proc. Natl. Acad. Sci. USA 
88:8850 (1991); Cristiano et al., Proc. Natl. Acad. Sci. USA 
90:2122-2126 (1993) ) . Receptor-mediated nucleic acid uptake 
can be used to introduce nucleic acid into cells either in 
25 vitro or in vivo and, additionally, has the added feature 
that nucleic acid can be selectively targeted to a 
particular cell type by use of a ligand which binds to a 
receptor selectively expressed on a target cell of interest, 
or can be non-selective with respect to the target cell 
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type. 

The precise stoichiometric ratio of the various components 
of the delivery vehicle can be varied in order to control 
the magnitude of the initial immune response, the efficiency 
5 of delivery and the degree of specific targeting to cells. 

In the case of non-specific delivery to cells, a non- 
specific ligand may be used that targets a cell surface 
receptor; in the case of specific delivery, a ligand may be 

10 used that targets a specific subset of cells. For example, 
soluble DNA/ poly lysine complexes can be generated (Li et 
al., Biochejn. J. 12:1763 (1973)) • Polylysine complexes 
tagged with asialoglycoprotein have been used to target DNA 
to hepatocytes In vitro (Wu and Wu, J. Biol. Chem. 262:4429 

15 (1987); U.S. Patent 5,166,320). Lactosylated polylysine 
(Hidoux et al^. Nucleic Acids Res, 21:871-878 (1993)) and 
galactosylated histones (Chen et al.. Human Gene Therctpy 
5:429-435 (1994)) have been used to target plasmid DNA to 
cells bearing lectin receptors, and insulin conjugated to 

20 polylysine (Rosenkrantz et al., Exp. Cell Res. 199:323-329 
(1992)) to cells bearing insulin receptors. Monoclonal 
antibodies have been used to target DNA to particular cell 
types (Machy et al., Proc. Natl. Acad. Scl. USA 85:8027- 
8031 (1988); Trubetskov et al., supra and WO 91/17773 and WO 

25 92/19287) . 

In a preferred embodiment, a ligand is included in the 
delivery vehicle for targeting a vector of the invention to 
cells of a specific type of tissue. Preferably, the ligand 
is specific for an epitope expressed on the surface of cells 
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of a particular type of tissue* For certain cancer tissue, 
preferred ligands include antibodies or fragments thereof, 
such as monoclonal antibody C242 which recognizes the CA242 
domain of CanAg expressed on the colon cancers (Lindholm et 
5 al.. Int. Arch. Allergy Appl. Immunol. 71:171-181 (1983) and 
Larson et al., Jnt. J. Canc&r 42:877-882 (1983)); monoclonal 
antibody SM3, which recognizes polymorphic epythelial mucin 
(PEM) expressed by breast cancer cells (Burchell et al., 
Cancer Research 47:5476-5482 (1983)) and monoclonal 
10 antibodies recognizing a novel epitope found on the 
epidermal growth factor receptor of human glial tumors but 
not on normal tissues (Moscatello et al., Cancer Res. 
55:5536-5539 (1995)). Furthermore, growth factors can also 
be used to target tumors. For example, the epidermal growth 
15 factor (EGF) receptor is over expressed on lung cancer cells 
and thus EGF can be used as the ligand to target the 
delivery vehicle to lung cancers (Cristiano et al.. Cancer 
Gene Therapy 3:4-10 (1996)). 
In addition, certain soluble macromolecules can be used for 
20 passive tumor targeting of certain tumor types. Many solid 
tumors possess vasculature that is hyperpermeable to 
macromolecules. Although the reasons for this are not 
clearly understood, the result is that such tumors can 
selectively accumulate circulating macromolecules. The 
25 enhanced permeability and retention effect (EPR effect) is 
thought to constitute the mechanism of action of SMANCS 
(sytrene/maleic-anhydride-neocarzinostatin) , now in regular 
clinical use in Japan for the treatment of hepatoma. 
Another class of conjugates under investigation for 
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anticancer activity is N-(2-hydroxypropyl) methacrylamide 
copolymer-anthracycline conjugates (Seymour, L.^ Critiaal 
Reviews In Therapeutic Drug Carrier Systems 9(2): 135-187 
(1992)). Thus, a polymer comprising styrene/maleic- 
5 anhydride or N- ( 2 -hydroxy-propyl ) methacrylamide copolymer 
can be used as a ligand to target the delivery vehicle of 
the present invention. 



10 Pharmaceutical Compositions and Therapeutic Use 

The pharmaceutical compositions of the present invention 
may comprise a self -replicating episomal vector or delivery 
composition of the present invention, if desired, in 
admixture with a pharmaceutically acceptable carrier or 

15 diluent, for therapy to treat a disease or provide the cells 
of a particular tissue with an advantageous protein or 
function. 

A self -replicating episomal, LCR-containing vector of the 
invention or composition or delivery vehicle comprising a 

20 vector of the invention may be administered via a route 
which includes intramuscular, intravenous, aerosol, oral 
(tablet or pill form), topical, systemic, ocular, as a 
suppository, intraperitoneal and/or intrathecal and direct 
injection of the vector DNA or delivery vehicle into a tumor 

25 mass. 

The exact dosage regime will, of course, need to be 
determined by individual clinicians for individual patients 
and this, in turn, will be controlled by the exact nature 
of the protein expressed by the gene of interest, and the 
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type of tissue that is being targeted for treatment • 

The dosage also will depend upon the disease indication and 
the route of administration. Advantageously, the vectors of 
the invention are designed to provide a long-term expression 
5 of a selected gene only in the cells of a specific tissue, 
i.e., in the host cells of tissue in which the LCR, or 
component thereof, is functional. Thus, in such host cells, 
duration of treatment will generally be continuous or until 
the cells die. The number of doses will depend upon the 
10 disease, and efficacy data from clinical trials. 

The amount of vector DNA delivered for effective gene 
therapy according to the invention will be in the range of 
between about 50 ng -1000 fxg of vector DNA/kg body weight; 
and preferably in the range of between about 1-100 /xg vector 
15 DNA/kg. 

Although it is preferred according to the invention to 
administer the episomal vector to a mammal for in vivo cell 
uptake, a vector as described herein may be administered 
utilizing an ex vivo approach whereby cells are removed from 

20 an animal, transduced with the episomal vector containing a 
selected gene under the control of a particular LCR, or 
component thereof, and then re-implanted into the animal. 
The liver, for example, can be accessed by an bx vivo 
approach by removing hepatocytes from an animal, transducing 

25 the hepatocytes in vitro with the episomal vector containing 
a gene encoding a selected protein or therapeutic nucleic 
acid, and re-implanting the transduced hepatocytes into the 
animal (e.g, , as described for rabbits by Chowdhury et al.. 
Science 254:1802-1805, 1991, or in humans by Wilson, Hum. 
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Gene Ther. 3:179-222, 1992). Such methods also may be 
effective for delivering an expression vector of the 
invention to various populations of cells in the circulatory 
or lymphatic systems, such as erythrocytes, T cells, and B 
5 cells. 



Transgenic Animals For Testing Tissue Soecificitv or 
Efficacy of Gene Therapy 
In another embodiment of the invention, there is provided 

10 a mammalian model for determining the tissue-specificity 
and/or efficacy of gene therapy using a self-replicating 
episomal LCR-containing vector of the invention. The 
mammalian model comprises a transgenic animal whose cells 
contain the vector of the present invention. Methods of 

15 making transgenic mice (Gordon et al., Proc. Natl. Acad. 
Sci. USA 77:7380 (1980); Harbers et al.. Nature 293:540 
(1981); Wagner et al., Proc. Natl. Acad. Sci. USA 78:5016 
(1981); and Wagner et al., Proc. Natl. Acad. Sci. USA 
78:6376 (1981), sheep, pigs, chickens (see Hammer et al., 

20 Nature 315:680 (1985)), etc., are well-known in the art and 
are contemplated for use according to the invention. Such 
animals permit testing prior to clinical trials in humans. 

For example, the transgenic animal may have a tumor or the 
propensity to develop a tumor. Stewart et al., Jnt. J. 

25 Cancer 53:1023 (1993)) describe a transgenic mouse having a 
T-cell tumor. Teitz et al., Proc. Natl. Acad. Sci. USA 
90:2910 (1993)) describe transgenic mice having 
rhabdomyosarcomas and insulin-producing pancreatic-islet 
tumors. An episomal vector containing a selected gene to be 
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expressed in the tumor cells alone may be transduced into 
ova from any of these tumor*-bearing mice to create an animal 
model for determining whether the LCR, or component thereof, 
strictly limits expression of the selected gene to tumor 
5 cells and also to determine the result of expressing the 
selected gene in such tumor cells. 

Transgenic animals containing self -replicating, LCR- 
containing episomal vectors of the invention also may be 
used for long-term production of a protein of interest in a 

10 tissue-specific manner; for example, for production of human 
globin in red cells of the transgenic animal using the 
globin LCR or the HS 3 and 4 combination or HS3 alone, or 
production of a protein of interest in milk of a transgenic 
animal using the LCR macrophage/ lysozyme LCR. 

15 The following examples are offered by way of illustration 
and are not intended to limit the invention in any manner. 
The preparation, testing and analysis of several 
representative constructs of the invention is described in 
detail below. One of skill in the art may adapt these 

20 procedures for preparation and testing of other vectors of 
the invention using different episomal vectors, different 
LCRs, or components thereof, and different genes of 
interest. 

EXAMPLE 1 

25 Preparation of Test Gene Constructs and Vector Systems 
1. EBV Vector 

One EBV-based vector useful according to the invention is 
the P220.2 vector (Fig. 1). The p220.2 vector contains (i) 
the Grip and EBNA-l gene from EBV, (ii) a hygromycin 
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resistance gene for selection of stable transfected 
eukaryotic tissue culture cells, (iii) a polylinker cloning 
site for test genes and (iv) a pBR322 vector backbone which 
contains a weak eukaryotic promoter sequence which drives 
5 expression of the EBNA-1 gene in eukaryotic cells. 

2. Reporter Genes 

The reporter genes (Fig. 2) used to assess LCR activity was 
derived from human )9-globin: 4 . 1 kb Hpal-EcoRV fragment with 
815 base pairs (bp) 5' and 1685 bp 3* flanking sequences 
10 (Fritsch et al.. Cell 19:959-972 (1980); Lawn et al.. Cell 
19:959-972 (1980) 

The reporter gene was separately cloned by blunt-end 
ligation into the unique polylinker Hindlll site of p220.2 
(Fig. 2) . 

15 3 . fl-alobin LCR Test Constructs 

Three i9-globin LCR components are the DNase 1 
hypersensitive sites HS2, HS3 and HS4 (Collis et al.^ EMBO 
J. 9:223-240 (1990)). These components were cloned as 1.5 - 
2 kb fragments individually or in combinations of two or 
20 three sites (see below) upstream of the /9-globin or 7H2K 
reporter genes in the Sail site of p220.2 (see. Fig. 3) . 

EXAMPLE 2 

Analysis of Stable Transfection of Tissue Culture Cells 
25 EBV-based vectors are most stable as self-replicating 
episomes in primate or human cells. Therefore stable 
transf ections of the ^-globin LCR test gene constructs were 
carried out in the human myelogenous leukemia cell line K562 
(Lozzio and Lozzio, Blood 45:321-334 (1975)). This cell 
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line displays an embryonic (c) and foetal (7) pattern of 
globin gene expression (Anderson et al., Int. J. C^ncor 
23:143-147 (1979)) and has previously been successfully used 
to assess ;3-globin LCR function (Blom van Assendelft et al., 
5 Cell 56:969-977 (1989)), 
1. Transfection Method 

K562 cells (10^) were transfected by electroporation with 
50 tig of supercoiled test construct DNA using a Bio Rad Gene 
Pulser (BioRad, Hercules, CA) set at 960 fiF and 3 00 V as 
10 described in Antoniou, 1991. Each electroporated sample was 
divided equally between two 75 cm^ tissue culture flasks in 
order to generate two independent pools of stable 
transfected cells. Hygromycin B (250 /xg/ml) was added 24 
hours after electroporation and maintained at this 
15 concentration thereafter. Cell death of non-transf ected 
cells was evident after 7-10 days (equivalent to about 10 
cell divisions) and confluent flasks of transfected cells 
were obtained 7 days later. 

2. RNA a nd DNA Preparation from Transfected Cells 
20 (i) Total RNA was prepared by selective precipitation in the 
presence of 3 M LiCl and 6 M urea (Auffrey and Rougeon, Eur. 
J. BiochBm. 107:303-314 (1980)). 

(ii) Total genomic DNA was prepared from isolated nuclei. 
Cells (-2 X 10^) were disrupted in RSB (10 mM Tris-HCl pH 
25 7.5, 10 mM NaCl, 3 mM MgClg) plus 0.1% NP4 0 followed by 
centrifugation to pellet the nuclei. The nuclei were 
subsequently lysed by resuspending in TNE (150mM Tris-HCl, 
pH 7.5, lOOmM NaCl, 5mM EDTA) , 1% SDS and 50 fig/ml 
proteinase K and incubated overnight at 55 °C. This was 
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followed by one extraction with an equal volume of 
phenol /chloroform (1:1) and the DNA precipitated from the 
aqueous phase with 0.7 volumes of isopropanol. The 
resulting DNA precipitate was spooled out of the solution, 
5 washed once in 70% ethanol and dissolved in 200 /il of TE (10 
mM Tris-Cl. 0.1 mM EDTA, pH 8.0). 

3. Analysis for Expression of Test Genes 

Transfected gene expression was assessed by an Sl-nuclease 
protection assay (Antoniou et al.. Human Genetic Disease 

10 Analysis. A Practical Approach (second edition) (1993) 
using an end labeled DNA probe from the 5 • - region of the 
globin (Antoniou et al., (1988) or 7H2K genes. These probes 
detect only those transcripts which have arisen by 
initiation from the correct, wild-type cap site (Fig. 4 and 

15 5, lower panel) . As an internal standard, all samples were 
simultaneously assayed for the endogenous 7-globin mRNA 
using a probe from the 3'-end of the gene (Fig. 4, lower 
panel) . 

4. Assessing Eoisomal Status and Copv Number of Transfected 
20 DNA 

(a) Episomal Status. 

Total DNA (10 /ixg) from isolated nuclei was digested with 
Pvul and Dpnl (6 units) overnight and separated by 
electrophoresis on a 0.6% agarose gel and Southern blotted 
25 onto Magnacharge Plus nylon transfer membranes. Dpnl 
digestion ensures that residual bacterial derived plasmid 
DNA is destroyed. The blot was then probed with the p220.2 
vector labeled with ^^P by nick-translation. Transfected DNA 
in the form of episomes appear as linear molecules of 12- 



wo 98/07876 



47 



PCT/GB97/02213 



17.5 kb depending on the size of the test construct, 
(b) Episome Copy Number 

The same quantity of DNA was digested with EcoRI and Dpnl 
and Southern blotted as above. The blot was probed with an 
5 92 0 bp BamHI-EcoRI fragment spanning intron II of the ^- 
globin gene. This results in the detection of a 5.5 kb 
fragment from the three copies of the endogenous ^S-globin 
gene and 7.2 - 12.7kb fragment from the transfected 
constructs . 

10 

EXAMPLE 3 

1. Expression Analysis of /9-Globin Reported Gene Constructs 
Figure 4 (upper panel) shows the autoradiogram from the 
analysis of RNA by an Sl-nuclease protection assay for the 
15 expression of the transfected gene constructs. The sites 
were linked to the /S-globin gene in the EBV-based self- 
replicating episomal expression vector p220.2 and stably 
transfected into K562 cells. Total RNA was analyzed by an 
Sl-nuclease protection assay using a mixture of probes to 
20 simultaneously detect both the /3-globin 5' mRNA from the 
transgene and the endogenously expressed 7-globin mRNA (3» 
7) which acts as an internal reference. Quantitation of the 
results in Figure 4 was by Phosphor imager (Molecular 
Dynamics) and is summarized in Table l. 
25 In this experiment, little or no expression was seen with 
the ^-globin gene alone (no LCR component present in the 
self-replicating episomal vector) (lane /3i) , a vector 
containing either HS2 or HS4 improved expression only by a 
small degree (lanes 2(3 and 4^). In contrast, a vector 
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containing HS3 (lanes 3/3) alone gave expression levels that 
were 10 times greater than a vector containing either HS2 or 
HS4 alone. The level of expression obtained with a vector 
containing HS3 was comparable with that observed with a 
5 vector containing a combination of HS2/HS3 (lanes 23/3) or a 
vector containing a combination of HS3/HS4 (lanes 34)9) • The 
HS2/HS4 vector (lanes 24/8) gave the weakest transcriptional 
activity of the two-site constructs. The highest level of 
transcription was obtained using a vector containing a 

10 combination of all three /3-globin LCR elements (HS2,3,4) 
(lanes 234)9) . This level was at least twice as great as HS3 
alone or the HS2/HS3 and HS3/HS4 two-site constructs. 

This experiment was repeated and the results are shown in 
Figure 6. The results of this repeated experiment again 

15 show that the /9-globin gene alone in unable to express in 
K562 cells (lanes /J) • The addition of either )3LqR HS2 
(lanes 2/?) or HS3 (lanes ^3) , gave significant increases in 
expression to a comparable degree. HS4 (lanes p4) did not 
improve expression above that of the )8-globin gene alone* 

20 The two /9LCR HS site combinations (lanes 230, 34/3 and 24/9) 
all increased expression above that of any of the HS sites 
alone. The combination of all 3 HS sites (lanes 234)3) gave 
the highest levels of expression as seen before. The 
results in Figure 6 essentially confirm the results shown in 

25 Figure 4, and demonstrate expression with HS2 alone. 
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Construct Endogenous/ Endogenous Exogenous 

Exogenous 7 signal fi signal 

X 10^ 



5 HS2, 3,4/9-2 
HS2,3,4/9-l 
HS3/3-1 
HS3^-2 
HS3, 2/3-2 

10 HS4, 2/9-2 
HS4 , 2/3-1 
HS3, 2/9-1 
HS4 ,3/3-2 
HS4, 3/3-1 

15 p-1 

HS2/9-1 
HS2/8-2 
HS4/9-1 



32 
28 
27 
19 
16 
15 
14 
12 
9 
9 
7 
6 
6 
5 



856.338 
581.853 
309.087 
899.079 
342.098 
2407.205 
2737.317 
1181.171 
720.397 
604.471 
1385.387 
433.451 
421.97 
502.573 



273.22 
163.35 

84.487 
168.217 
55.253 
360.629 
377.922 
143.282 
66. 132 
54.556 
93.697 
25.151 
24.077 
25.731 



20 Table I t Expression of the human /3-globin under the 
control of the /7LCR in self-replicating EBV-based episomal 
vectors in K562 cells (results Shown in Figure 4). 
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This table shows t:he guan1:itat:ion of the si nuclease 
protection assay shown in Figure 4, of RNA from K562 cells 
stably transfected with either the human /9-globin gene alone 
or under the control of different combinations of the 
5 DNasel hypersensitive (HS) sites of the fil*CR in the self 
replicating EBV-based vector p220*2 (Figure 3) using a 
Molecular Dynamics phosphor imager • 

EXAMPLE 4 

10 The human ^S-globin gene either alone (fi) or under the 
control of /3LCR HS2 (20), HS3 (3)9) and HS2 , 3 and 4 (5/?) in 
the EBV-based, self -replicating episomal vector p220.2 (see 
Figures 2 and 3), were stably transfected into K562 (Panel 
A) and HeLa (Panel B) cells by electroporation (see text) . 

15 Supercoiled plasmid DNA (50/zg) was mixed with 2 x 10^ cells 
and electroporated with the Bio Rad Gene Pulser set to 
deliver a single pulse of 960/xF at 300V. Stably transfected 
cells were selected with 250/xg hygromycin. Total RNA was 
extracted from the stably transfected pools that were 

20 generated and analysed for transgene expression by an Sl- 
nuclease protection assay using end-labelled DNA probes. 
The p-globin mRNA from the transfected gene was detected 
with the same 5' probe (S^fi) as shown in Figure 4. 
Detection of 7-globin mRNA with a 5' exon II probe giving 

25 a protected fragment of 207 nucleotides (5*7), acted as an 
internal reference in K562 cells. The constitutive and 
ubiguitously expressed mRNA for hnRNPA2 was probed as an 
internal reference in the HeLa cell derived samples (giving 
an 122 nucleotide Sl-protected fragment, A2) . The results 
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are shown in Figure 5. 

The result:s show that an episomal vector carrying HS3 alone 
is expressed tissue-specifically, i.e., in K562 cells but 
not in HeLa cells, whereas an episomal vector carrying HS2 
5 alone or carrying HS2 , 3 and 4 is not expressed tissue- 
specifically, as each is expressed in both K562 and HeLa 
cells. Therefore, the presence of HS2 in an episomal vector 
results in expression of the gene of interest but not in a 
tissue-specific manner, whereas the presence of HS3 alone 

10 (i.e, in the absence of HS2) results in expression of the 
gene of interest in a tissue-specific manner. Therefore, 
a classical enhancer, such as HS2 (localized to a NF-E2/AP- 
1 dimer binding site within its core), (Tuan et. al., Proc. 
Natl. Acad. Sex. USA 86:2554-2558 (1989); Ney et. al.. Genes 

15 Dev. 4:993-1006 (1990a); and Ney et al., Nuc. Acids Res. 
18:6011-6017 (1990b) is distinguishable from an LCR or LCR 
component specifying tissue-specificity by the inability of 
the former to confer tissue specific transcription according 
to the invention. 
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1. A self -replicating episoxnal DNA expression vector for 
expressing a gene of interest in a host cell in a tissue- 
restricted manner, the vector comprising: 

5 (a) a self-replicating origin of replication; and 

(b) an LCR, or component thereof, which when operatively 
linked to a gene of interest and present in a host cell 
directs expression of said gene in a tissue-restricted 
manner • 

10 

2. The self-replicating episomal DNA expression vector 
of claim 1, further comprising a gene of interest 
operatively linked to the LCR, or component thereof. 

15 3. The self -replicating episomal DNA expression vector 
of claim 2 wherein the component of an LCR is a component of 
the i9-globin LCR consisting essentially of HS3* 

4. The self -replicating episomal DNA expression vector 
20 of claim 2 wherein the LCR, or component thereof is the /9- 

globin LCR or component thereof excluding site HS2 . 

5. The self -replicating episomal DNA expression vector 
of claim 2 wherein the component of an LCR is a component of 

25 the /9-globin LCR consisting essentially of HS3 and HS4. 

6. The self -replicating episomal DNA vector of any one 
of the previous claims, wherein the origin of replication 
is a viral origin of replication. 



wo 98/07876 



55 



PCT/GB97/02213 



7. The self -replicating episomal DNA expression vector 
of claim 6 wherein the viral origin of replication is an 
origin of replication from Epstein-Barr virus. 

5 

8. The self -replicating episomal DNA expression vector 
of any one of the previous claims, further comprising a 
sequence encoding a replication factor required for 
replication of the expression vector in a host cell. 

10 

9. The self -replicating episomal DNA expression vector 
of claim 8 wherein the sequence encoding the replication 
factor is selected from the group consisting of a sequence 
encoding EBNA-1 of Epstein-Barr virus, a sequence encoding 

15 El of papilloma virus, and a sequence encoding E2 of 
papilloma virus. 

10. The self -replicating episomal DNA expression vector 
of any one of the previous claims, further comprising an 

20 antibiotic resistance gene for selecting cells in culture 
stably transfected with the expression vector. 

11. The self -replicating episomal DNA expression vector 
of any one of the previous claims, further comprising a 

25 eukaryotic transcription termination sequence placed between 
the LCR and the gene of interest and operative to prevent 
transcription therebetween. 
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12. A pair of vectors comprising a self-replicating 
episomal DNA expression system for expressing a gene of 
interest in a host cell in a tissue-restricted manner, the 
pair of vectors comprising: 

5 i. a first vector comprising 

(a) an origin of replication; 

(b) an LCR, or component thereof, which when 
operatively linked to a gene of interest and present in a 
host cell directs expression of said gene in a tissue- 

10 restricted manner; and 

(c) a cloning site for a gene of interest; and 
ii. a second vector comprising 

(a) an origin of replication; and 

(b) a sequence encoding a replication protein, said 
15 replication protein being necessary for replication of said 

origin of replication. 

13. The pair of vectors of claim 12, further comprising 
a gene of interest operatively linked to the LCR, or 

20 component thereof. 

14. The pair of vectors of claim 12 or claim 13 wherein 
the component of an LCR is a component of the /3-globin LCR 
consisting essentially of HS3. 

25 

15. The pair of vectors of claim 12 or claim 13 wherein 
the LCR, or component thereof is the ^-globin LCR or 
component thereof excluding site HS2. 
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16. The pair of vectors of claim 12 or claim 13 wherein 
the component of an LCR is a component of the )S-globin LCR 
consisting essentially of HS3 and HS4. 

5 17. The pair of vectors of any one of claims 12 to 16 
wherein the origin of replication is a viral origin of 
replication . 

18. The pair of vectors of claim 17, said viral origin of 
10 replication being from Epstein-Barr virus 

19. The pair of vectors of any one of claims 12 to 16 
wherein the sequence encoding the replication factor is 
selected from the group consisting of a sequence encoding 

15 EBNA-1 of Epstein-Barr virus, a sequence encoding El of 
papilloma virus, and a sequence encoding E2 of papilloma 
virus . 

20. The pair of vectors of any one of claims 12 to 19, 
20 wherein each said first and second vector further 

comprising, individually, an antibiotic resistance gene for 
selecting cells in culture stably transfected with the 
expression vector. 

25 21. The pair of vectors of any one of claims 12 to 20 
wherein, said first vector further comprising a eukaryotic 
transcription termination sequence placed between the LCR 
and the cloning site for the gene of interest. 
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22. A method for expressing a gene of Interest in cells 
of a specific tissue-type comprising administering a self- 
replicating episomal DNA expression vector of any one of 
claims 1 to 11 or a pair of vectors of any one of claims 12 

5 to 21 to a mammal, 

23. A method of obtaining persistent, tissue-specific 
expression of a gene of interest in a host cell in culture, 
comprising culturing a host cell transfected with the vector 
of any one of claims 1 to 11 or the pair of vectors of any 
one of claims 12 to 21. 

24. A transgenic animal containing cells which contain the 
expression vector of any one of claims 1 to 11 or the pair 
of vectors of any one of claims 12 to 21. 

25. A method for identifying an LCR or component thereof 
which when comprised in an episomal DNA expression vector, 
operatively linked to a gene of interest and present in a 
host cell directs expression of said gene in a tissue- 
restricted manner, comprising: 

i. testing the LCR or component thereof by transfecting 
an episomal vector containing the candidate LCR or 
component thereof operatively linked to a marker gene 
into a cell line in which the LCR when integrated is 
active and also into a cell line in which the LCR when 
integrated is inactive; and 
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ii. identifying the LCR or 
active in the cell 
integrated is active. 



component thereof which is only 
line in which the LCR when 
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